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ABSTRACT 

The Red MSX Source (RMS) survey has identified a large sample of massive young stellar 
objects (MYSOs) and ultra compact (UC) HII regions from a sample of ~2000 MSX and 
2MASS colour selected sources. Using a recent catalogue of molecular clouds derived from 
the Boston University-Five College Radio Astronomy Observatory (BU-FCRAO) Galactic 
Ring Survey (GRS), and by applying a Galactic scaleheight cut off of 120pc, we solve the 
distance ambiguity for RMS sources located within 18° < |/| > 54°. These two steps yield 
kinematic distances to 291 sources out of a possible 326 located within the GRS longitude 
range. Combining distances and integrated fluxes derived from spectral energy distributions, 
we estimate luminosities to these sources and find that > 90 per cent are indicative of the 
presence of a massive star. We find the completeness limit of our sample is ~1O 4 L , which 
corresponds to a zero age main sequence (ZAMS) star with a mass of ~12M . Selecting only 
these sources, we construct a complete sample of 196 sources. 

Comparing the properties of the sample of young massive stars with the general popu- 
lation, we find the RMS-clouds are generally larger, more massive, and more turbulent. We 
examine the distribution of this sub-sample with respect to the location of the spiral arms and 
the Galactic bar and find them to be spatially correlated. We identify three significant peaks 
in the source surface density at Galactocentric radii of approximately 4, 6 and 8 kpc, which 
correspond to the proposed positions of the Scutum, Sagittarius and Perseus spiral arms, re- 
spectively. Fitting a scale height to the data we obtain an average value of ~ 29 + 0.5 pc, which 
agrees well with other reported values in the literature, however, we note a dependence of the 
scale height on galactocentric radius with it increases from 30 pc to 45 pc between 2.5 and 
8.5 kpc. 

Key words: Stars: formation - Stars: early-type - ISM: clouds - Galaxy: kinematics and 
dynamics. 



1 INTRODUCTION 

Throughout their lives massive stars (M, > 8M Q ) play an im- 
portant role in many aspects of astrophysics including many of 
the most energetic phenomena in the Universe. They deposit huge 
amounts of energy into the interstellar medium in the form of UV 
radiation, which leads to the creation of HII regions, and kinetic en- 
ergy via molecular outflows, powerful stellar winds, and supernova 
explosions. Massive stars are also responsible for the production 
of enriched material that alters the local chemistry. These feedback 

* E-mail: James.Urquhart@csiro.au (ATNF) 



processes play an important role in regulating star formation within 
the surrounding environment, possibly triggering the formation of 
future generation s of stars, and ul timately driving the evolution of 
their host galaxy dKennicuttll2005l) . 

Given the profound impact massive stars have, not only on 
their local environment, but also on a Galactic scale, it is crucial to 
understand the environmental conditions and processes involved in 
their birth and the earliest stages of their evolution. The life cycle 
of massive stars once they have emerged from their natal molecular 
cloud is generally thought to be understood. However, the processes 
involved in their formation and the early stages of their evolution 
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are still shrouded in mystery, with our understanding lagging be- 
hind that of low-mass star formation. 

Massive stars are much rarer than their low-mass counterparts 
and consequently are generally located much farther away than 
regions of low-mass star formation. Furthermore, massive stars 
evolve extremely rapidly, reaching the main sequence while still 
deeply embedded within dense cores. The formation of massive 
stars and the earliest stages of their evolution are therefore hidden 
behind many magnitudes of visual extinction, only observable at 
infrared and millimetre wavelengths. Moreov er, massive stars ar e 
known to form exclusively in clusters (e.g.. Ide Wit et alj|2004h . 
which makes it difficult to attribute derived quantities to individ- 
ual sources. Consequently, the known samples are too small to test 
many aspects of massive star formation theories, and due to the var- 
ious selection criteria, are probably unrepresentative of the class as 
a whole. 

We have conducted a new galaxy-wide search (the Red MSX 
Source, or RMS surve)!]) with the specific aim of identifying a large 
well selected sample of massive young stellar objects (MYSOs). 
Observationally, a MYSO is defined as a stellar object which is 
mid-IR bright and processes sufficient luminosity to form a mas- 
sive star (greater than a few times 10 3 L Q ) but shows no evidence 
that a HII region has begun to form (i.e., no radio emission and no 
spatially extended mid-infrared emission). To place MYSOs in an 
evolutionary context they are somewhat later than the hot molec- 
ular core s tage, which is not gen erally found to be mid-infrared 
bright (e.g jDe Buizer et al ■|2002[) . but have not yet begun to ionize 
their surroundings sufficiently to produce a detectable HII region. 
Although our efforts are directed towards identifying MYSOs we 
are also sensitive to the slightly later ultra-compact (UC)HII re- 
gion stage. It is unclear whether core hydrogen burning has be- 
gun in the MYSO phase, however, their association with strong 
molecular outflows indicates accretion is ongoing. As the MYSO 
evolves and moves towards the main sequence it will begin to form 
an HII region which will eventually halt accretion at which point 
the final mass of the star is set. The MYSO and UC HII stages are 
two of the earliest and most important in the evolution of massive 
stars since they encompass the transition between accretion and UV 
dominated regimes. 

Before we can determine each source's luminosity, and thus, 
confirm the presence of a massive young star we need to esti- 
mate its heliocentric distance. Kinematic distances can be derived 
using the sourc e velocities in conjunction with a Galactic rota- 
tion m odel (e.g jBrand & Blitzll993r . lAlvarez et al.ll99d : IClemensl 
Il985h . However, for sources located within the solar circle (Galac- 
tocentric radii < 8.5 kpc) there are two possible kinematic distances 
for any given velocity, equally spaced on either side of the tangent 
point, commonly referred to as the near and far distances. Kine- 
matic distance ambiguities affect approximately ~70 per cent of 
our sample and therefore represent a serious hurdle that needs to be 
addressed before many of the important source parameters can be 
calculated. 

In this paper we assign kinematic distances to ~300 young 
massive star candidates located in the northern Galactic plane by 
correlating their spatial positions and velocities with a catalogue of 
molecular clouds, derived from 13 CO data from the Galactic Ring 
Survey (GRS), for which the near-far distance ambiguities have 
been solved. In the next section (Sect. 2) we give a brief overview of 
both the RMS and GRS surveys and describe the procedure used to 
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identify associations. In Sect.[3]we present a summary of the RMS- 
cloud associations, assign distances, and calculate luminosities for 
RMS sources. In Sect. 4 we derive the physical sizes and masses 
of all GRS clouds and compare the properties of those associated 
with a sub-sample of the most luminous RMS sources with those 
that are not currently forming massive stars. We discuss the Galac- 
tic distribution of our sample of young massive stars with respect 
to the positions of the spiral arms and Galactic bar in Sect.[5] In 
Sect.[6]we present a summary of the results and our conclusions. 



2 SURVEY DESCRIPTIONS AND DATA SETS 
2.1 RMS Survey 

The RMS survey is based on a se arch of the MS X mid- 
infrared survey of the Galactic Pl ane i Price et alj 200lh . Using 
the MSX Point Source Catalogue ( lEganetalJl2003h " we colour- 
selected a large samp le of candidate MYSOs and UCHII regions 
dLumsden elal]|2002l) . This initial sample was further refined us- 
ing near-infrared photometry obtained from 2MASS dCutri et all 
l2003l : lskrutskie & et al.ll200q) to eliminate blue objects, and from 
visual inspection of the MSX images to remove the more extended 
sources. Our colour-selection and su bsequent filtering pro duced a 
sample of -2000 MYSO candidates dLumsden et ai1l2002l) . 

The RMS Survey is a multi-wavelength programme of follow- 
up observations designed to identify genuine MYSOs and UCHII 
regions. These observations have been designed to identify and re- 
move other kinds of embedded or dusty objects such as planetary 
nebulae, evolved stars and nearby low-mass YSOs dHoare et al .1 
l2005l : IUrauhart et al.ll2008r) . To date we have used arcsecond res- 
olution mid-infrared ima ging from the Spitzer GLIMPSE survey 
dBenia min & et al. 2003]) or our own ground-based imaging (e.g., 
iMottram et al J 120071) to reveal multiple and/or extended sources 
within the MSX beam, as well as MYSOs in close proximity to 
existing HII regions. We have obtained arcsecond r esolution radio 
continuum i mages with the ATCA and the VLA dUrquhart et al .1 
l200l l2009h to identify UC HII regions and PNe whilst obser- 
vations of 13 CO transitions dUrquhart et all 120071 120081. 120091) 
deliver kinem atic velocities. Fina lly, we use near-infrared spec- 
troscopy (e.g.. Iciarke et al.l l2006h to distinguish the more patho- 
logical evolved stars and confirm source classifications. 

By combining these infrared, millimetre and radio wavelength 
observations with complementary archival data we have identified 
approximately 1300 UCHII regions a nd candidate MYSOs located 
throughout the Galactic plane (b <5 °: IUrquhartetai]|2008h . 



2.2 The BU-FCRAO Galactic Ring Survey 

The 13 CO (7=1-0) Galactic Ring Survey (GRS) covers a total area 
of 75.4 deg 2 between 18°< / <55.7° and \b\ <1° and is fully sam- 
pled with a pixel size of 22" . The processed data cubes with Galac- 
tic longitudes I < 40° have a Vlsr range = -5 to 135 km s _I , while 
those with / > 40° have V L sr range = -5 to 85 km s -1 . The spectral 

resolution for both velocity r anges is 0.21 km s' 1 . 

In a recent paper, Rathborne et al .1 1 20091) used the 
CLUMPFIND algorithm dWilliams et al] 1 19941) to analyse 
this homogeneous, fully sampled data set, resulting in the identi- 
fication of 829 molecular clouds distributed t hroughout the GRS 
region . This work was complemented by iRoman-Duval et all 
d2009h who use d archival HI da ta from the VLA Galactic Plane 
Survey (VGPS; Istil et alj[20o3) to break the near-far kinematic 
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distance ambiguity towards 750 molecular clouds (~90 per cent 
of all clouds identified). The combination of a complete set of 
molecular clouds and kinematic distances provides an enormously 
important catalogue that can be used for determining distances 
to star-forming regions, allowing their distribution, luminosity 
functions, etc., to be mapped with respect to the large-scale 
Galactic structure. 

2.3 Resolving the Kinematic Distance Ambiguity 

Obtaining heliocentric distances to our sources is a crucial element 
of our campaign as without these we cannot accurately calculate 
source luminosities, and thus, distinguish between nearby low and 
intermediate-mass YSOs from the more generally more distance 
MYSOs. As mentioned in the introduction kinematic distance am- 
biguities affect a large proportion of our sample (~900 sources in 
total) and these need to be resolved before we obtain a robust sam- 
ple of young massive stars. 

There have been a number of studies dev oted to r esolvi ng dis- 
tances ambiguities using a variety of methods. iDamd ( fl99 lb found 
correlations and anti-correlations between near-infrared extinction 
and molecular clouds traced by CO emission could be used to re- 
solve the ambiguity towards molecular clouds. This method worked 
by assuming that the anti-co rrelation was the resu lt of dust associ- 
ated with foreground clouds. fPaladini et alj J2004 b used a statistical 
approach to derive a luminosity-p h ysical diame t er correlatio n for 
HII regions iDownes et ail jl980h . [Arava et all i bOOll |2002|) and 
ISewilo et all d2004l) have used a combination of HI 10a radio re- 
combination lines (RRLs) and formaldehyde (H2CO) absorption 
measurements to assign near and far solutions for HII regions. 

Although these studies have had reasonable success in resolv- 
ing ambiguities they tend to be used only for a particular type of ob- 
ject and/or require a specific set of conditions and so are not univer- 
sally applicable. Another technique that has been applied success- 
fully to both UCHII regions and YSOs combines 21 cm HI absorp- 
tion with a molecular tracer such as 13 CO and implicitly assumes 
that the source is still embedded within its natal molecular cloud. 
For HII regions this technique relies on the principle that if a con- 
tinuum source is located at the near distance, the HI data will not 
show any absorption from clouds with velocities between that of 
the HII region and the tangent point, since these are located behind 
the continuum source with respect to our line of sight. If absorp- 
tion is present in the HI data at distances between the continuum 
sources and the velocity of th e tangent point the source must be lo- 
cated at the far distance (e.g. jKuchar & Baniall 19941 iKolpak et al.1 
120031) . For YSOs this technique looks for HI self-absorption (SA) 
at the same velocity as the source and works on the principle that if 
the source's host cloud is located at the near distance, it would lie in 
front of a significant column of warmer HI, resulting in absorption 
by cold HI associated with the cloud. Conversely, the absence of an 
abso rption dip would imply the source is locat ed at the far distance 
(e.g jjackson et al.ll2002HBusfield et alj2006h . 

In this study we will focus on the sub-sample of UC HII re- 
gions and MYSO candidates located within the GRS longitude 
range. Combining these two data sets has the potential to solve 
the distance ambiguities towards approximately one-third of our 
inner-Galaxy sources, and 75 per cent of the sources located in the 
Northern Galactic Plane. The distance ambiguities in this region 
have been solved using a combination of the two HI absorption 
techniques described in the previous paragraph for a flux-limited 
sample of molecular clouds jRoman-Duval et al .l2009h . Both meth- 
ods have been used for molecular clouds that are also found to be 



associated with a HII region with the results of both techniques be- 
ing check to ensure consistency between the two. The distance so- 
lutions have been checked with many of the previous surveys and 
the results are found to be in reasonable agreement. Therefore the 
GRS catalogue provides robust distances to a complete sample of 
molecular clouds located within the first quadrant of the Galactic 
plane. 



2.4 Matching RMS sources with GRS clouds 

In the Northern Galactic Plane we have 427 RMS sources located 
within the solar circle that are affected by these kinematic distance 
ambiguities. Searching this sample we have identified 306 young 
massive stars (UC HIIs and MYSO candidates) located within the 
GRS region. In order to obtain kinematic velocities for these, we 
extracted spectra from the GRS data cubes and fitted them with 
Gaussian profiles. In cases where more than one significant emis- 
sion peak was present in the CO spectrum, we have used archival 
mas er or high-density ga s tracers to determine the source velocity 
(see lUrouhart et aHl2008l for more details). In total, we have been 
able to assign a unique velocity to 300 MYSO candidates and com- 
pact HII regions within the GRS region. 

By comparing the Galactic longitudes and latitudes and the 
velocities of the RMS sources wi th those derived for the clouds 
reported bv lRathborne et al.ld2009l) we are able to identify the pop- 
ulation of clouds that are responsible for giving birth to the next 
generation of massive stars in the Galaxy. To find a match we began 
by searching within the GRS in a small region around the coordi- 
nates of each RMS source. This region was 5 x 5 x 11 resolution 
elements in the / x b x v directions corresponding to 15 arcmin in 
Galactic longitude and latitude and 4.67 kms~' in velocity. This 
search returned a unique cloud match for 116 RMS sources. For a 
further 175 sources, multiple clouds were found within the search 
radius. In the majority of these case there were only a few pixels 
from a neighbouring cloud and, to determine a match, we selected 
the cloud that contributed the most pixels within the search region. 

We failed to find any cloud association for fifteen of the 306 
RMS sources; we present a list of these sources in TableQ] Of 
these, the GRS molecular line observations towards two failed to 
detected any significant level of emission in the observed veloc- 
ity range. The CO results towards four sources are inconclusive 
with multiple components with similar line strengths being de- 
tected and no other tracer is available. We note that the velocity 
assigned to G043.96 74+00.9938 was derive d from the CS obser- 
vations presented by iBronfman etafl dl996h and falls outside the 
velocity range covered by the GRS, which explains why we were 
unable to find a match for this particular source. The limited veloc- 
ity range covered by the GRS survey may also explain the lack of 
any detected CO emission towards the two sources towards which 
no CO emission is detected. 

This leaves eight sources with assigned velocities but for 
which no associated cloud has been identified. One possible ex- 
planation for this is that an incorrect velocity has been assigned to 
the RMS source. Another possibility is that the host cloud has not 
been detected by the CLUMPFIND algorithm. To investigate these 
possibilities we compared the 13 CO spectra with other molecular 
transitions reported in the literature towards these sources to check 
the velocity assignment was consistent wit h previous studies. For 
seven sources we find CS (7=2-1) emission ( IBronfman et alj 19961) 
at the same velocity as the 13 CO emission. The CS rotational tran- 
sition has a much higher critical density than the 13 CO transition, 
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Table 1. Summary of unmatched RMS sources located within the GRS re- 
gion. 



MSX Name 


Vlsr 


Near Dist. 


Far Dist. 


RGC 




(kms -1 ) 


(kpc) 


(kpc) 


(kpc) 


G019.8922+00.1023 


45.7 


3.6 


12.4 


5.3 


G025.3O58+OO.53O8 










G025.4948-00.2990 










G026. 1094-00.0944 










G027.2220+00.1361* 


112.8 


6.9 


8.2 


3.9 


G032.2718-00.2260 


22.3 


1.6 


12.8 


7.2 


G034.6243-00.1300 


23.0 


1.6 


12.4 


7.2 


G039.4943-00.9933t 


53.2 


3.6 


9.5 


6.2 


G039.865 1+00.6474 










G042.0977+00.3521 


20.7 


1.4 


11.2 


7.5 


G043.9674+00.9938t 


-19.7 


14.1 


14.1 


10.0 


G047.0998+00.4799 










G047.9002+00.0671* 


72.0 


5.3 


5.3 


5.9 


G052.7528+00.3343 


15.2 


1.0 


9.2 


7.9 


G053.9584+00.0317 


41.6 


4.1 


5.9 


6.9 



t Source assigned to the near distance as the far distance would imply a 
source location more than 4 times the Galactic scale height from the mid- 
plane and is thus unlikely. 

£ Source velocity lies outside the velocity range covered by the GRS. 

* This source is located within ~0.5 kpc of the tangent point and has been 

assigned the tangent point velocity. 
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Figure 1. Integrated 13 CO 7=1-0 emission in a 15'xl5' region around 
G019.8922+00.1023. The position of the RMS source is indicated by a 
cross. The image was produced by integrating the GRS cube over a ve- 
locity range of three times the velocity dispersion either side of the source's 
assigned velocity. The FCRAO beam size is shown in the bottom left hand 
comer. 

and thus, the presence of the two transitions at the same velocity 
makes it unlikely the assigned velocities are incorrect. 

Having confirmed that the velocities assigned to the RMS 
sources are likely to be correct we need to investigate why we have 
been unable to associate them with a molecular cloud. In order to 
investigate the spatial extent of the emission surrounding each of 
these sources we produced integrated velocity maps of the 13 CO 
emission around each source at the assigned velocity. In Fig.[T]we 
present an example of one of these integrated 13 CO maps showing 
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Figure 2. The number of RMS sources found in each molecular cloud. 

the emission coincident with RMS source G019.8922+00.1023 in 
both position and velocity. This reveals a relatively bright, com- 
pact region of emission coincident with the the RMS source. We 
find a similar situation for the other sources, however, the 13 CO 
emission in nearly all cases is quite weak (T* K ~ 0.4 - 1.2 K) and 
it is therefore likely that the emission associated with these small 
clouds fell below the c riteria used to generate the GRS catalogue 
dRafhborne et alj2009l) . 



3 DISTANCES AND LUMINOSITIES 

iRoman-Duval et alj j2009h used the lciemensl jl985h Galactic rota- 
tion model to determine the kinematic distances to the GRS sam- 
ple of clouds. However, to maintain co nsistency with previous pub- 
lished work on the RMS sample (i.e.. lUrquhart et alj[2007L | 2008h 
we have recalculated their distances using the lBrand&Blitzl l ll993h 
rotation model assuming a distance to the Galactic centre of 8.5 kpc 
and a solar velocity of 220kms~'. The difference in derived dis- 
tances between the two models is small (typically +0.3 kpc) com- 
pared to the error associated with kinematic distances, which are 
of order ± 1 kpc in this part of the Galaxy, and therefore the actual 
choice of rotation curve does not have any impact our findings. 

3.1 RMS-GRS cloud associations 

We have identified the molecular clouds associated with 291 
RMS sources. In Table[2] we present a summary of the GRS 
cloud parameters and the number of associated young mas- 
sive stars (only a small portion of the table is provided here, 
the full table is only available in electronic form at the CDS 
via anonymous ftp to cdsarc.u-strasbg.fr (130.79.125.5) or via 
http://cdsweb.u-strasbg.fr/cgi-bin/qcat?J/A+A/ l. Given the a ngular 
size of the molecular clouds identified by Rathborne ct al. (2009) 
we might expect to find multiple regions of star formation within a 
significant number of clouds and this is indeed the case. We find the 
RMS sources are associated with 204 clouds, with 53 clouds being 
found to be associated with two or more sites of star formation. 

In Fig.|2] we present a histogram of the number of RMS 
sources found in each cloud. The majority of clouds are associ- 
ated with a single RMS source, with the number of clouds falling 
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Table 2. Summary of GRS cloud parameters and RMS source associations. 
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Figure 3. Distribution of heights above the mid-plane as a function of 
Galactic longitude for RMS sources with assigned distances (grey) and the 
heights that correspond to the near and far distances (shown in red and blue 
respectively) for high latitude sources (i.e., \b\ > 1°). 



off rapidly as a function of RMS source density. We find two 
clouds that have an RMS source count o f nine (G043. 19-00.01 and 
G049.49-00.4h iRathborne et alj|200^) . Being associated with so 
many potential sites of massive star formation distinguishes these 
clouds as two of the most intense regions of massive star formation 
in this part of the Galaxy. G049.49-00.41 is part of the W49 giant 
molecular cloud, which is the most massive cloud in the Galaxy 
outside the Galactic centre Jsimon etal The second cloud, 

G043. 19-00.0 1, is coincident with a w ell known OH maser source 
(OH43.8-0. 1; lBraz & Epchtein|[l983l) and a variable H 2 maser 

dLekfatkOOd). 

iRoman-Duval et al. I J2009I) resolved the distance ambiguities 
to a hundred and eighty-five of the RMS-GRS cloud associations, 
which are themselves host to a total of 268 RMS sources. We are 
therefore able to assign kinematic distances to ~90 per cent of the 
matched RMS sources. Furthermore, examination of the remaining 
RMS sources in the GRS region without an assigned cloud iden- 
tified four sources for which a distance can be assigned (see su- 
perscripts and footnotes in Table[T]for individual source details). 
One of these (G043.9674+00.9938) has a velocity outside the GRS 
range that corresponds to a location outside the solar circle, and 
therefore is not subject to a kinematic distance ambiguity. An- 
other source (G025. 398 1+00.56 17) is assigned to the near distance 
through an analysis of its heights above the Galactic mid-plane 
(see following sub-section for details). Finally, we find two sources 
that are located within ~0.5 kpc of the tangent point and so have 
been assigned the distance of the tangent point. Including these 
four sources, we have resolved the distance ambiguities towards 
272 RMS sources. 



3.2 Distances to high latitude sources 

In the previous section we matched RMS sources with molecular 
clouds identified within the GRS and obtained distances to 272 
young massive stars. However, in order to obtain a fully repre- 
sentative sample we also need to resolve the distance ambigui- 
ties to sources located within the GRS longitude range but with 
\b\ > 1°. Although we have obtained 13 CO velocities towards a fur- 
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G021.84+00.59 


18.2 


5.96 


0.21 


3.23 


G021.84-00.26 


23.1 


10.54 


0.09 


4.44 


G021.89-00.56 


30.2 


9.37 


0.11 


4.92 


G02 1.94+00.04 


13.0 


6.60 


0.16 


3.56 


G02 1.94+00. 14 


34.8 


8.76 


0.11 


4.73 


G02 1.99-00.06 


32.6 


7.72 


0.11 


4.18 


G022.04+00.19 


26.2 


10.35 


0.10 


4.54 



No. RMS 

Sources 



Notes: Only a small portion of the data is provided here, 
the full table is only available in electronic form at the CDS 
via anonymous ftp to cdsarc.u-strasbg.fr (130.79.125.5) or via 
http://cdsweb.u-strasbg.fr/cgi-bin/qcat 7J/A+A/ 



ther twenty sources in this category, there are currently no archival 
HI data available at their Galactic latitudes, and thus, we cannot em- 
ploy eit her the HI self-absorp t ion or continuum absorption methods 
used by IRoman-Duval et al] d2009l) . However, the larger angular 
separation of these sources from the Galactic mid-plane would cor- 
respond to unrealistically large distances from the plane if a source 
were located at the far distance. 

In Fig.[3]we plot the height above the Galactic mid-plane for 
the 20 sources with \ b\ > 1°, corresponding to the near and far 
distances. In addition to the high latitude sources, we also show 
the distribution of heights of sources with assigned distances, and a 
scale that corresponds to 4 times the Galactic scale-height (see the 
discussion in Sect.[5). Comparing the heights of the RMS sources 
with assigned distances with those of the high latitude sources, it 
is very unlikely that any of them are located at the far distance. We 
have therefore assigned the near distance to these sources. 

Using a combination of association with molecular clouds of 
known distance, and applying the scale-height cutoff described in 
the previous paragraph, we have been able to assign distances to 
292 RMS sources from our initial sample of 326 located within 
the GRS longitude range. This corresponds to ~90 per cent of the 
sample. 
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A (/Ltm) 

Figure 4. An example of a SED fit to infrared and submillimetre fluxes. The 
flux values and their associated errors are shown as circles and vertical lines 
respectively, upper limits are shown as inverted triangles. The solid black 
line shows the best fit to the data while the grey lines trace the next ten best 
SED fits to the data. 



3.3 Luminosity 

Luminosities have been calculated for each source with an assigned 
distance by fitting a spectral energy distribution (SED) to a set of 
infrared to millimetre flux measurements. Near- and mid-infrared 
fluxes have been taken from the 2MASS and MSX point source 
catalogues. Far-infrared fluxes were obt ained from eithe r MIPS- 
GAL or the IRAS Galaxy Atlas (IGA; ICao et~ai1ll997l) images, 
using a s et of 2-D backgrou nd-fitting aperture photometry rou- 
tines (see lMottram et al.ll20ld for details). The MIPS GAL data are 
generally superior to the IRAS data although, in many cases, the 
MIPSGAL images were found to be saturated at the location of the 
RMS sources and could not be used. Fin ally, (sub)millimetre fluxe s 
were obtained from archival SCUBA 



SIMBA dFaundez etalJl2004l;lHilletal. 
and MAMBO dBeuther et al]2002h data 



Di Francesco et al 



20051; iBeltran et alj|2006l) 



20081) . 



SEDs were constructed for each sou rce using the model fitter 
produced bv lRobitaille et al]d2006l . [20071) . In Fig.[4]we present the 
measured fluxes and subsequent SED fit to the data obtained for the 
RMS source G025. 7161+00. 0486. Source luminosities were calcu- 
lated by integrating under the SED and multiplying by the square 
of the assigned distance. A detailed description of the photometry 
routines used to extract fluxes from the MIPSGAL and IRAS im- 
ages, and a full discussion of the procedures will be presented in an 
upcoming paper (Mottram et al. 2010, submitted). 

In Fig.|5]we present the luminosity distribution of all the RMS 
sources located within the GRS longitude range that have assigned 
distances. The left panel contains a histogram of the source lu- 
minosity function which peaks at approximately 10 4 L G , confirm- 
ing that we are finding a significant number of young massive 
stars. The latest stellar spectral type that can ionize its surround- 
ings and form an HII region is a B3 which has a correspondin 



mass of ~5.2Mb and a lu minosity of ~550 L Q dMartins et al.l200: 
iMevnet & Maedej|2000h , and thus, we define sources with lumi 
nosities larger than this as a massive star. We find that 290 of 
the 292 RMS sources (~99 per cent) with assigned distances have 
larger luminosities than would be expected from a B3 star — as- 
suming the observed luminosity arises from a single embedded star. 
Massive stars are known to form exclusively in clusters. However, 
IWood & Churchwelll ( fl98^) showed that for a realistic initial mass 



function the most massive member of a cluster is only a couple of 
spectral classes lower than that determined by assuming the lumi- 
nosity was due to a single star. Taking this into account we find that 
any RMS source with a luminosity greater than 10 4 L G (i.e., equiv- 
alent to a star of spectral type B 1 or earlier) is likely to contain a 
massive star. Applying this limit to identify a bona fide sample of 
massive young stars we find approximately two-thirds (193) of our 
sources have luminosities over 10 4 L Q , and are therefore luminous 
enough to host massive young stars. We note that these fractions 
only apply to the current sample since there is likely to be number 
of high mass stars that fall below our detection threshold. 

In the right panel of Fig.|5]we present a plot showing the dis- 
tribution of source luminosities as a function of heliocentric dis- 
tance along with the MSX 21/jm limiting sensitivity. The sensi- 
tivity has been estimated by first calculating the flux in the MSX 
21 ^m i (band E) using F E = 4.041 x 10" 14 5 2 i Wm 2 dCohen et all 
l200d) . where 5 2 i is the MSX 21 yum detection limit (-2.7 Jy). This 
value is then multiplied by a factor of 24, calculated from the ra- 
tio of bolometric fluxes, determined from SED fits, to the MSX 
band E fluxes (Mottram et al. 2010, submitted). The fairly homo- 
geneous distribution in heliocentric distance of RMS sources above 
10 4 L G , would suggest that our sample of young massive stars is 
complete for luminosities above this across the Galaxy. To prop- 
erly determine the completeness limit requires modelling of the 
luminosity function which is beyond the scope of this paper and 
will be the subject of a subsequent paper (Davies et al. 2010, in 
prep.), however, preliminary results are consistent with the com- 
pleteness level derived from Fig.[5]. In Table[3]we present a sum- 
mary of each RMS source located within the GRS longitude range 
including positions, associated GRS cloud name, assigned kine- 
matic distance, height above the mid-plane, Galactocentric distance 
and bolometric luminosity (only a small portion of the table is pro- 
vided here, the full table is only available in electronic form at the 
CDS via anonymous ftp to cdsarc.u-strasbg.fr (130.79.125.5) or via 
http://cdsweb.u-strasbg.fr/cgi-bin/qcat?J/A+A/). 



4 CLOUD PROPERTIES 

In this section we will discuss the Galactic distribution of molecu- 
lar gas and compare the physical properties of clouds found to be 
associated with the RMS sample of young massive stars. To facil- 
itate these discussions we need to c alculate a few key paramet ers 
currently unavai l able i n either of the lRoman-Duval et al. I d2009h or 
iRathborne et al.l d2009l) papers. 



4.1 Deriving cloud masses and physical sizes 

In this subsection we will estimate masses and physical radii for ev- 
ery cloud that has an unambiguous distanc e, and for which an exci - 
tation temperature has been calculated by IRathborne et all d2009h . 
The observed radius of each cloud has been esti mated using the 
projec ted area of the cloud on the sky, A, given by IRathborne et all 
d2009h via: 



(A\ 05 
R abs = (-) 

To correct for the spatial resolution we use the following: 



„2 / #FWHM \ 2 



(1) 



(2) 
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Table 3. Summary of RMS source parameters located within the GRS longitude range. 



MSX Name 




RA 


Dec. 


GRS Cloud Name 


Distance 


Height 


Rgc 


Luminosity 






(J2000) 


(J2000) 




(kpc) 


(pc) 


(kpc) 


Logio(L ) 


G017.9642+00.0798 


18:23:20.82 


-13:15:05.4 


G017.99+00.09 


2.27 


3.2 


6.4 


3.2 


G017.9789+00.2335 


1! 


!:22:49.08 


-13:09:59.0 


G017.99+00.09 


1.50 


6.1 


7.1 


2.6 


G018.1409-00.3021 


u 


!:25:04.46 


-13:16:26.7 


G018.24-00.41 


4.36 


-23.0 


4.6 


4.5 


G018.1504-00.2794 


1! 


!:25:00.62 


-13:15:18.3 


G018.24-00.41 


4.36 


-21.2 


4.6 


4.5 


G018.3029-00.3910 


Li 


!:25:42.48 


-13:10:20.2 


G018.34-00.36 


2.98 


-20.4 


5.7 


4.7 


G018.3412+01.7681 


Li 


5:17:58.15 


-12:07:24.9 




2.96 


91.3 


5.8 


4.4 


G018.3706-00.3818 


\i 


!:25:48.26 


-13:06:29.5 


G018.39-00.41 


3.67 


-24.5 


5.1 


3.5 


G018.6608+00.0372 


Li 


!:24:50.42 


-12:39:20.8 


G018.69+00.04 


10.86 


7.0 


3.9 


4.4 


G018.6696+01.9631 


Li 


i: 17:54.09 


-11:44:32.2 




2.95 


101.1 


5.8 


4.0 


G018.7621+00.2634 


Li 


5:24:12.91 


-12:27:37.8 


G018.64+00.29 


14.09 


64.8 


6.6 


5.3 


G018.8246-00.4673 


Li 


!:26:59.01 


-12:44:47.0 


G018.94-00.26 


4.60 


-37.5 


4.4 


3.9 


G018.83 19-00.4788 


1) 


5:27:02.35 


-12:44:43.0 


G018.89-00.51 


4.65 


-38.9 


4.4 


3.7 


G018.8330-00.3004 


tt 


5:26:23.64 


-12:39:40.3 


G018.69-00.06 


12.41 


-65.1 


5.2 


5.0 


G019.0741-00.2861 


Li 


5:26:48.14 


-12:26:28.3 


G019.09-00.26 


4.62 


-23.1 


4.4 


5.2 


G019.4902+00.1350 


1! 


5:26:04.26 


-11:52:36.1 


G019.39-00.01 


2.44 


5.8 


6.2 


3.7 


G019.6053-00.9012 


1! 


5:30:02.59 


-12:15:23.7 


G019.59-00.91 


3.07 


-48.2 


5.7 


3.9 


G019.6085-00.2357 


tt 


5:27:38.30 


-11:56:41.2 


G019.64-00.26 


12.57 


-51.7 


5.4 


5.7 


G019.6106-00.2531 


1) 


5:27:42.28 


-11:57:03.9 


G019.64-00.26 


12.57 


-55.5 


5.4 


4.4 


G019.7268-00.1132 


1! 


5:27:25.20 


-11:46:59.1 


G019.59-00.06 


11.67 


-23.1 


4.7 


4.5 


G019.7403+00.2799 


1) 


5:26:01.48 


-11:35:16.4 


G019.24+00.34 


1.90 


9.3 


6.7 


3.2 


G019.7540-00.1279 


Li 


5:27:31.48 


-11:45:56.8 


G019.59+00.09 


8.00 


-17.9 


2.9 


4.6 


G019.8817-00.5347 


Li 


5:29:14.35 


-11:50:30.1 


G019.89-00.56 


3.52 


-32.8 


5.3 


4.1 


G019.8922+00.1023 


1) 


5:26:57.28 


-11:32:10.6 












G019.9224-00.2577 


Li 


5:28:18.83 


-11:40:37.5 


G019.94-00.21 


4.59 


-20.6 


4.5 


3.3 


G019.9386-00.2079 


Li 


5:28:09.88 


-11:38:22.9 


G019.94-00.21 


4.59 


-16.6 


4.5 


3.4 


G020.080 1-00. 1360 


Li 


5:28:10.41 


-11:28:51.6 


G019.74-00.26 


12.62 


-30.0 


5.5 


5.7 


G020.5 143+00.4936 


1! 


5:26:43.51 


-10:48:12.6 


G020.49+00.49 










G020.7121-00.0754 


tt 


5:29:09.09 


-10:53:35.1 


G020.79-00.06 


11.66 


-15.3 


4.8 


4.4 


G020.7438-00.0952 


U 


5:29:16.96 


-10:52:27.1 


G020.79-00.06 


11.66 


-19.4 


4.8 


4.9 


G020.7491-00.0898 


U 


5:29:16.39 


-10:52:01.2 


G020.79-00.06 


11.66 


-18.3 


4.8 


5.1 



Notes: Only a small portion of the data is provided here, the full table is only available in electronic form at the CDS via anonymous ftp to cdsarc.u-strasbg.fr 
(130.79.125.5) or via http://cdsweb.u-strasbg.fr/cgi-bin/qcat 7J/A+A/ 



where Sfwhm is the effective beam size of the smoothed GRS cube. 

Rathborne et al. calculated the optical depths and exci- 

tation temperatures for all of the clouds by combining 12 CO (7=1- 
0) data from the Un iversity of Massachusetts-Stony Brook survey 
dSanders etalll 19861) and the 13 CO data from the GRS. They cal- 



culated typical optical depths and excitation temperatures of ~0.13 
and 9 K respectively. We can now use these results to estimate the 
mass of the individ ual clouds. Following the method outlined by 
iBourke et al.l dl997t) it is possible to estimate the total mass of a 
cloud using a single optically thin molecular transition and assum- 
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Figure 6. Left panel: Distribution of GRS cloud LTE masses. The bin size used is 0.5 dex. Right panel: LTE mass distribution as a function of heliocentric 
distance. In both plots the whole sample of GRS clouds is shown in grey while the GRS clouds associated with an RMS source with a luminosity greater than 
10 4 L Q are shown in red. 



ing an appropriate abundance ratio. Since only a single transition is 
used, the mass is determined assuming local thermodynamic equi- 
librium (LTE) and is referred to hereafter as the LTE mass (M LTE ). 
We use the following equation taken from lBourke et al ] dl997n : 



Mr 



A'n 



M 



[H 2 / 13 CO] (D_ 
kpc 



2.72m H 7 x 10 5 
0.312 



r m b dv dQ. 



(3) 



1 - exp(-5.29/r ex ) 

where the fi m is the mean molecular mass per IL molecule, ran 
is atomic mass of hydrogen, D is the heliocentric distance, v is in 
kins -1 and SI is the source solid angle in arcmin 2 . We have as- 
sumed an abundance ratio of H2 to 13 CO of 7 x 10 5 dFrerking et al] 
ll982tlBourkeetalj|l997l) a mean molecular mass of 1.36 ;;ih, tak- 
ing a ccount of the H e content o f of the gas. Dist ances were taken 
from | Roman-Duval et al] d2009l) and the T ex from lRathborne et al] 
d2009h . The / / r m b dv dfl term in Eqn. 3 is equal to the total inte- 
grated in tensity which has been calculated for each cloud and pre- 
sented in lRathborne et al]d2009h . We present the derived values for 
cloud LTE masses in Col. 5 of Table[2] 

We have estimated the LTE masses for 734 GRS clouds. In 
the left and right panels of Fig.[6]we present the distribution of LTE 
cloud mass and plot the distribution of cloud mass as a function 
of heliocentric distance. In both of these plots we show the whole 
catalogue of clouds in grey and indicate the clouds associated with 
RMS sources with a luminosity above 10 4 L Q in red. 

The mass distribution shown in the left panel of Fig.[6] peaks 
at masses ~10 4 ~ 4 ' 5 M Q , which is smaller than the masses gen- 
erally reported for giant molecular clouds (GMCs; ~10 5 ~ 6 M Q ; 
ISolomonetalJl987h . The lower masses derived for the GRS clouds 
are consequence of using the weaker 13 CO transition rather than 
the 12 CO transition normally used to determine the properties of 
GMCs. We are therefore not tracing the full extent of the GMCs and 
missing a significant amount of emission from the more extended 
low-density envelope. We can obtain an estimate of the proportion 
of the GMCs traced by the 13 CO emission by making a few simple 
assumptions. In the lower panel of Fig. 7 we present a histogram of 
the mass density per kpc~ 2 as a function of galactocentric radius. 



Assuming that the region sampled by the GRS is representative of 
the Galaxy as a whole and integrating the emission over In we esti- 
mate the total mass of molecular gas would be 6.3 x 10 7 M Q . Com- 
paring the mass estimated fr om the 13 CO emission w ith the value 
of ~ 9 x 10 s M Q derived by iBlitz & Williams! d 19991) for the total 
mass i n GMCs (based on work by Damd 1 ~993l) and lHunter et al] 
d 19971) reveals that only a small fraction (~7 per cent) of the entire 
GMC has sufficient column density to have been detected by the 
GRS. 



4.2 Galactic distribution of molecular gas 

In this subsection we use the cloud LTE masses derived in the previ- 
ous subsection to investigate the distribution of molecular material 
as a function of Galactic longitude and Galactocentric radius. In 
Fig. [7] we present two histograms showing these relationships for 
all of the GRS clouds (grey) and the subsample of clouds associ- 
ated with young massive stars above the RMS completeness limit 
(red). In the upper panel we show the total integrated LTE mass as a 
function of Galactic longitude, while in the lower panel we present 
the mass surface density distribution as a function of Galactocen- 
tric radius to remove any bias due to the uneven areal coverage of 
the GRS. 

The distribution of LTE mass as a function of Galactic Lon- 
gitude (upper panel of Fig.[7]( for the GRS catalogue and the sub- 
sample associated with RMS sources reveal two distinct peaks, the 
strongest of the two is located at / ~ 20-25°, with the second peak 
located at / ~ 30-35°. The these peaks correspond with the concen- 
tration molecular gas coincident with the end of the Galactic bar 
and the line of sight along the tangent of the Scutum arm. These 
peaks also coincide with two peaks seen i n the longitude distribu - 
tion of the old star population reported bv lBeniamin et al] d2005h . 
It is also worth noting that there is no evidence of a peak in the LTE 
mass distribution at the expected longitude of the Sagittarius arm 
tangent (/ ~ 46 — 50°). The similar absence of a peak in the dis- 
tribution of the old star population was noted bv lBeniamin] d2008h 
and has led to speculation that the Galaxy has two principle spiral 
arms, Scutum-Centaurus and Perseus, and two secondary or minor 
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Figure 7. Distribution of LTE mass as a function of Galactic longitude and 
mass surface density as a function of Galactocentric radius. The whole sam- 
ple of GRS clouds is shown in grey while the GRS clouds associated with 
an RMS source with a luminosity greater than 10 4 L Q are shown in red. 

arms teeniamij200ollChurchwell et al]2009l) . We will investigate 
this in more detail in the next section. 

In the lower panel Fig. [7] we present the mass surface density 
distribution as a function of Galactocentric radius. This distribution 
reveals three peaks, the strongest is ~4- 4.5 kpc, a slightly weaker 
peak ~6-6.5 kpc, and one at ~7.5-8 kpc. These peaks correspond 
to the expected galactocentric radii of the Scutum, Sagittarius and 
Perseus arms. 

4.3 Properties of RMS-GRS cloud associations 

In the previous section we identified the molecular clouds involved 
in the formation of young massive stars by matching the RMS 
sources with the GRS clouds. In this section we will compare 
the properties of GRS clouds that are associated with the forma- 
tion of the next generation of young massive stars, and those that 
are not associated. In the right panels of Figs. 6 and 7 we present 
plots showing the distribution of RMS source luminosities and 
GRS cloud LTE masses as a function heliocentric distance; these 
plots show that we are virtually complete to young massive stars 
and GRS clouds above ~10 4 L G and ~ 10 3 M e , respectively. In total 
iRathborne et"aH j2009t) identified 829 molecular clouds of which 
595 have masses greater than ~10 3 M Q . Of these, 129 clouds are 



also associated with an RMS source with a luminosities greater 
than ~10 4 L o ; in the following analysis we refer to these clouds 
as massive star forming clouds and the remaining 464 clouds as 
non-massive star forming clouds. 

In Fig.[8]we present normalised histograms and cumulative 
distribution function (CDF) plots for a number of physical param- 
eters for both the massive star forming clouds (red) and the non- 
massive star forming clouds (grey). These histograms and CDF 
plots clearly illustrate the significant differences between the two 
samples of clouds. 

We used Kolmorgorov-Smirnov (K-S) tests to evaluate the sig- 
nificance of the parametric differences between the two cloud sam- 
ples. We are able to reject the null hypothesis that the two popu- 
lation are drawn from the same parent population with confidence 
values above 3<x for all of the parameters mentioned in the previous 
paragraph. The only parameter that is not significantly different be- 
tween the two cloud samples is the optical depth (t). In Table|4]we 
present a summary of the averages, minimum and maximum values 
and the result of Kolmorgorov-Smirnov (K-S) tests comparing the 
massive star forming and non-massive star forming clouds. 

The clouds associated with massive star formations are in 
generally larger, more massive and, perhaps not suprisingly, are 
found to have significantly higher column densities. Furthermore, 
the massive star forming clouds are warmer and more turbulent as 
indicated by the higher excitation temperatures and larger velocity 
dispersion than found towards the non-massive star forming clouds. 
The only parameter that is not significantly different between the 
two cloud samples is the optical depth (t). 



5 GALACTIC DISTRIBUTION OF YOUNG OB STARS 

In this section we will use the distance and luminosity results 
obtained previously to investigate the spatial distribution of mas- 
sive stars in the Galaxy. Regions of massive star formation are al- 
most exclusively found to be associat ed with the spira l arms where 
molecular clouds are thought to form jKennicuttll2005h . The Galac- 
tic distribution of massive young stars is, therefore, an important 
probe of Galactic structure. 

In Fig.[9]we present two plots showing the Galactic distribu- 
tion of our complete sample of young massive stars. In both plots 
the positions of the RMS sources (L Bol ^ 10 4 L Q ) are indicated 
by filled red circles, the sizes of which provides an indication of 
their bolometric luminosities. For RMS sources that have been as- 
sociated with a molecular cloud we have assumed the RMS source 
distance is the same as its host cloud. Using the systemic velocity 
of the cloud smooths out localised velocity perturbations that might 
lead to larger scatter in the distances. Distan ces for the high l atitud e 
sources (i.e., \b\ > 1°) have been taken from lUrquhart et aT] ( feOOot ). 

The left panel of this figure simply presents the positions of 
the sample without the addition of Galactic features to lead the eye. 
Examining the distribution shown in this plot there are no immedi- 
ately obvious structures, however, one can begin to see structures 
that suggest the presence of spiral arms between the locus of tan- 
gent points and the solar circle. 

In the right panel of Fig.[9] we plot the positions of the 
RMS sources over an image of the Galaxy produced by Robert 
Hurt of the Spitzer Science Center in consultation with Robert 
Benjamin (University of Wisconsin- Whitewater). This image at- 
tempts to synthesise all that has been learnt about Galactic struc- 
ture over the past fifty years including: a 3. 1-3.5 kpc Galactic Bar 
at an angle of 20° with respect to the Galactic Centre-sun axis 
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Figure 8. Histograms for a selection of derived parameters chosen to show the distribution of physical parameters of clouds associated with the formation of 
massive stars (red) and the non-massive star forming clouds (grey). 



Table 4. Summary of general parameters derived from the GRS clouds with masses above 10 3 M Q . We have separated this sample of clouds into two groups; 
those found to be associated with a young massive star above the RMS completeness limit (i.e., LboI. > 10 4 L Q ) are classified as massive star forming with all 
other clouds being classified as non-massive star forming clouds. 



Parameter 




Non-Massive Star Forming Clouds 




Massive Star Forming Clouds 




K-S test 


Min 


Max 


Mean 


Median 


Std. dev. 


Min 


Max 


Mean 


Median 


Std. dev. 


Log[LTE Mass (M Q )] 


3.0 


5.4 


4.1 


4.1 


0.6 


3.1 


5.6 


4.6 


4.6 


0.5 


« 0.01 


Log[Cloud Radii (pc)] 
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( iBinnev et alj 1 199 lb iBlitz & SpergeH Il99ll ; bwek et alj Il995l) . a 
second non-axisymmetri c structure referred to as the "Long Bar" 
dHammerslev et alj2000|) with a Galactic rad ius of 4.4 ± 0.5 kpc at 
an angle of 44° ±10° dBeniamin et alj|2005h . the Near and Far 3- 
kpc arms, and the four principle arms: Norma, Sagittarius, Perseus 
and Scutum-Centaurus . The p osition of the arms is based on the 
iGeorgelin & Georgelinl dl976t) model which has been modified to 
incorporate Very Long Bas eline Array maser parallax measure- 
ments (e.g., IXu et al 20061) and refin ed directions for the spiral 
arm tangents from Dame et al] d200ll) . The Perseus and Scutum- 
Centaurus arms have been emphasised in this image to reflect the 
overdensities seen in the old stellar disk population towards their 



expected Galactic long itudes tangent positions dBeniamir] 120081 ; 
IChurchwell et al.ll2009l) . 



Comparing the distribution of our sample of young massive 
stars with the arms shown in the right panel of Fig.|9]reveals them 
to be strongly correlated. The highest density of massive stars is co- 
incident with the end of the Galactic bar and the proposed location 
of the start of the Scutum-Centaurus arm. The high concentration 
of young massive stars towards the end of the bar also correlates 
with the strongest peak in the gas mass distribution (as traced by 
the I3 CO luminosity; see iRoman-Duval et alj [20091 for more de- 
tails). Both distributions peak approximately at the leading edge of 
the bar, falling off steeply in the direction of rotation, and tailing 
off behind the bar. This high density of massive stars is also coin- 
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Figure 9. Galactic distribution of the complete RMS sample of MYSOs and UC HII regions with bolometric luminosites ^ 10 4 L o in the GRS field. In both 
panels we show the kinematic positions of our sample as red-in-black circles, the sizes of which give an indication of their respective luminosities. In the upper 
right corner of each plot we give the luminosities for a sample of zero age main sequence stars. In the left panel we plot the positions of our sample without 
any Galactic structure information to lead the eye. In the right panel we have superimposed the RMS source distribution over a sketch of how the Galaxy is 
thought to appear if viewed face-on. This image has been produced by Robert Hurt of the Spitzer Science Center in consultation with Robert Benjamin and 
attempts to synthesise many of the key elements of Galactic structure using the best data currently available (see text for more details). The position of the Sun 
is shown by the small circle above the image centre. The Roman numerals in the corners refer to the Galactic quadrants and the two thick lines originating 
from the location of the Sun enclose the region of the Galactic Plane covered by the GRS. The dot-dashed circles represent the locus of tangent points and the 
Solar Circle. 



cident with the co-rotation radius (~4.5 kpc). It is therefore unclear 
whether the high density of massive stars is the result of dynamical 
interaction between the bar and the spiral arms or co-rotation. 

We also note the correlation between the massive young stars 
and the Sagittarius and Perseus arms, this is particularly strong to- 
wards the Perseus arm. We note that the correlation of RMS sources 
with the far section of the Sagittarius arm is not as strong as seen 
towards the near section, with a number of sources lying between 
the Sagittarius and Perseus arms in the model. This may suggest 
that the model needs small adjustments or that we have a small 
systematic error in these sources. Comparing the position of the 
tangent-circle sources and the far 3 kpc arm we see correlation be- 
tween them, which suggests they are associated. In Fig.[T0]we show 
the source density per kpc~ 2 of young massive stars as a function 
of Galactocentric radius. This plot clearly reveals three significant 
peaks at approximately 4, 6 and 8 kpc. Inspection of the spatial 
distribution plot (Fig. 9) shows that the strongest peak at 4 kpc 
coincides with the intersection of the Long Bar and the Scutum- 
Centaurus arm, and that the second and third peaks are at the Galac- 
tocentric radii of the Sagittarius and Perseus arms, respectively. The 
distribution with Galactocentric radius is only dependent on the 
Galactic rotation curve and not on the solution of near-far ambi- 
guity. Therefore, the correlation of these peaks with the structures 
in the spiral-arm model lends support to their existence. 

A similar Galactocentric radial distribution is seen in the 



CO data pre s ented in Fig. 7 and has been reported for 12 CO by 
Clemens et al.l dl988h an d for a large sample of HII regions by 



Anderson & Bania i 2009h . however, in the case of the HII regions 
they do not find a peak at ~ 8 kpc. It is not clear why there is 
a peak in the distribution of young massive st ars at this distance 
from t he Galactic centre which is not seen in the l Anderson & Banial 
j2009h sample of HII regions. However, there is evidence of stream- 
ing motions in HI data at all three Galactocentric radii, which 
are a strong indication these are the locations of spiral arms 
dMcClure-Griffiths & Dickevll2007l) . 

In a recent study of IRDCs ( J ackson et alj2008h reported find- 
ing a significant concentration of sources at ~8 kpc in the 1st Quad- 
rant. Given than the identification of IRDCs is biased towards more 
nearby clouds the authors assumed their sources were all located at 
the near kinematic distance, and thus, the observed source enhance- 
ment was due to a h i gh de nsity of sources in the solar neighbour- 
hood. Ijackson et al.l 1 20081) consider the possibility that the peak 
could be due to sources located at the far side of the Galaxy, pos- 
sibly associated with a distant spiral arm, but conclude that this is 
less likely. We can rule out any possibility that the peak found at ~ 
8 kpc in our sample is due to high concentrations of sources located 
in the vicinity of the Sun since the luminosity cut has filtered out 
nearby low- and intermediate-mass objects. The correlation of the 
~8kpc peaks in both the IRDCs and our sample of massive stars 
suggests that the peak of IRDCs seen at 8 kpc may in part be due to 
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Figure 10. Histogram showing the surface density distribution of young 
massive stars as a function of Galactocentric radius. Errors have been cal- 
culated assuming Poisson statistics (i.e., *<J~N where N is the number of 
sources in each bin). 



Figure 11. Distribution of RMS sources as a function of Galactic longitude. 
The histogram of the whole sample of young massive stars is shown in red 
with sources associated with the Sagittarius arm shown in yellow. Errors 
have been calculated assuming Poisson statistics. 



a high concentration of IRDCs associated with the Perseus arm at 
the far side of the Galaxy. 

The positional coincidence of a large number of young mas- 
sive stars with the proposed location of the Sagittarius arm seen 
in Figs. 9 and 10 provide convincing evidence that a significant 
amount of star formation is taking place within this arm. Inspec- 
tion of the source density would suggest there are similar levels of 
star formation associated with all three spiral arms in this Galactic 
quadrant. These results are consistent with a model of the Galaxy 
consisting of four principle arms. However, these results appear to 
be in contradiction w ith the results of the 4.5 fan number counts 
jBeniamin et al J [20051) that trace the longitude distribution of the 
old stellar disc populations; these data fail show any evidence of an 
overdensity in the old stellar disc population at the expected tangent 
direction of the Sagittarius arm (/ = 46-50°). 

The tangent direction of the Sagittarius arm lies only a couple 
of kpc in front of the Perseus arm and it is possible that the over- 
density is being masked by a high stellar background associated 
with the Perseus arm. This is impossible to test using the old stellar 
disk population since there is no velocity data available. However, 
we can test this hypothesis using the RMS sample. In Fig.ll II we 
present a plot of the source density as a function of Galactic lon- 
gitude for all sources with luminosities > 10 4 L G (red histogram) 
and for the subset of these located between galactocentric radii of 
5.5 and 6.5 kpc (yellow histogram); this annulus effectively isolates 
sources associated with the Sagittarius arm. Inspection of the distri- 
bution of the whole sample reveals no evidence of an enhancement 
in the source density at the Sagittarius tangent. However, looking at 
the distribution of the subset of sources associated with the Sagit- 
tarius radius, there is an obvious increase in the source density to- 
wards the expected tangent position. The small numbers of sources 
in each longitude bin and, consequently, the relatively large errors 
make it hard to arrive at a definitive conclusion. However, this anal- 
ysis shows that background contamination is a viable explanation. 



we present a histogram showing distribution of all RMS sources 
within the solar circle (2.5 kpc < RGC < 8.5 kpc). The distribution 
of the whole sample peaks between 0-20 pc above the plane, which 
is consistent with the value found by other authors teeedl [20061 ; 
and references therein). The latitude distribution is not particularly 
symmetrical, falling off more quickly above the plane than below. 
Thus, the scale height has slightly different values above and below 
the plane the scale heights are ~ 17 and 42 pc respectively. The skew 
is probably the resu lt of our vie wing angle from a distance above 
the plane of ~ 20 pc l lReedl2006h . Fitting a scale height to the whole 
data set we obtain an average value of 2 9±0.5 pc, wh ich agrees ex- 
tremely well with the value measured by Reed ( 2000) from the dis- 
tribution of nearby main sequence OB stars. 

In the lower three panels of Fig.[l2]we present histograms of 
the latitude distribution as a function of galactocentric radius (Rgc)- 
These plots reveal the Galactic latitude distribution to be dependent 
on Rqc, with the scale height increasing with distance from the 
Galactic centre. The scale height increases from ~30pc between 
2.5 kpc < R G c < 4.5 kpc to ~45 pc between 6.5 kpc < R G c < 
8.5 kpc (see panels for scale heights obtained for the different galac- 
tocentric bins and their associated uncertainties). This increase in 
scale height with radius has also been seen in s tudies of the HI 
emission within t he Milky Wa y l lMalhotra|[T995l) and in external 
edge-on galaxies jRupen|[l99 ll) . The increase in the scale height as 
a function of galactocentric radius of both the young massive stars 
and HI is approximately linear. However, the scale height of the HI 
disk increases twice as quickly (from ~ 100 pc and ~220 pc between 
3 and 8 kpc). 

In addition to the correlation between stellar scale height and 
disc radius, these plots also reveal that the centroid of the scale 
height oscillates above and below the Galactic midplane. This is 
agai n broadly in line with the distributions of both CO and HI emis- 
sion iMalhotral 19951 and references therein). 



5.1 Galactic latitude distribution 

In Fig.[l2]we plot the distribution of young massive stars as a func- 
tion of distance from the Galactic mid-plane. In the upper panel 



6 SUMMARY 

We have derived distances and luminosities to a large sample of 
MYSO candidates and UCHII regions identified by the Red MSX 
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Figure 12. The Galactocentric latitude distributions of the RMS sources 
plotted in the left panel. Errors have been calculated assuming Poisson 
statistics. 



Source (RMS) survey. Distances to the majority of the sample 
have been obtained by cross-correlating with a s ample of molec- 
ular c louds (i.e., \b\ <1° and 17.8° < / < 54.4° ; iRathborne et all 
l2009h for which t he distance ambiguities ha ve been resolved using 
archival HI data dRoman-Duval et ak 1 12003) . In this way, we have 
obtained kinematic distances to 272 sources. Using these, we have 
calculated the Galactic scale height for MYSOs and applied a 4<r 
cut in order to break the distance ambiguity to high latitude sources 
located within the same longitude range (i.e., \b\ >1° and 17.8° 
< I < 54.4°). These two steps have provided kinematic distances 
to 292 sources out of a possible 326 located within this longitude 
range which corresponds to ~90 per cent of our sample. 

We used these assigned distances in conjunction with SED fits 
to calculate RMS source luminosities. We find 290 RMS sources 
have luminosities large enough to indicate the presence of a ZAMS 
star with a spectral type of B3 or earlier. From analysis of the lu- 
minosity distribution as a function of heliocentric distance, we es- 



timate that our sample of young massive stars is complete for lumi- 
nosities greater than ~ 10 4 L G out to a distance of ~ 15 kpc. In total 
we have identified 193 RMS sources with luminosities consistent 
with the presence of young massive stars above this completeness 
limit. 

We estimate the masses of 734 of the clouds in the GRS 
catalogue and, examining the mass-heliocentric distance distri- 
bution, we estimate that the catalogue is complete to clouds 
with masses greater than 10 3 M Q within ~ 15 kpc. Selecting only 
clouds above this completeness limit and comparing the proper- 
ties of those associated with young massive stars (i.e., > 10 4 L G ) 
with the remainder of the clouds, we find significant differences. 
The clouds involved in massive star formation are, in general, 
larger, have higher column densities, have significantly larger 
masses and are more turbulent. Typical values for these parame- 
ters are radius^60 pc, log(N(H 2 ))-22 cm -2 , mass— 2.5x 10 4 M Q and 
FWHM Vlsr— 4.3 km s _1 . 

The Galactocentric distribution of massive stars and molec- 
ular gas reveals strong peaks at approximately 4, 6 and 
8 kpc that correlate with streami ng motions seen in HI data 
tMcClure-Griffiths & Dickevll2007h . which is a strong indication 
that the mass and stellar density enhancements are associated with 
the spiral arms. Furthermore, the Galactic distribution of these 
young massive stars is found to be spatially correlated with the 
proposed locations of the Scutum, Sagittarius and Perseus spiral 
arms. We find a high concentration of stars with the location of 
the end of the Galactic bar and the Scutum spiral arm; this is 
also coincident with a brig ht region of molecular gas reported by 
iRoman-Duval et al There is a strong positional correlation 

between the massive stars and the Perseus and Sagittarius arms. 
We find similar levels of massive star formation associated with all 
three spiral arms. These results are consistent with a model of the 
Galaxy consisting of four principle arms. 

Using the latitude distribution of massive stars, we calculate 
the Galactic scale height for the whole sample of young massive 
stars to be ~ 29 ± 0.5 pc , which is in excellent agreement with the 
value reported by iReedl d2000t) derived from a sample of nearby 
main sequence OB stars. However, measuring the scale height of 
the sample as a function of galactocentric radius we find the scale 
height increases with increasing distance from the Galactic centre. 

In this paper we derive distances and luminosities to a sam- 
ple of ~300 MYSOs and UCHII regions identified from our pro- 
gramme of follow-up observations designed to examine the global 
characteristics of this galaxy-wide sample of massive young stars. 
Analysis of HI data is currently underway to derive distances and 
luminosities for a further ~1000 sources; these results will be pre- 
sented in a future publication. If the results found for the sample 
presented here are indicative of the remaining sources the RMS 
survey will fulfill its promise of returning the largest, well-selected, 
and Galaxy-wide sample of young massive stars yet produced. 
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